Mining for Relevant Terms from Log Files
نویسندگان
چکیده
The Information extracted from log files of computing systems can be considered one of the important resources of information systems. In the case of Integrated Circuit design, log files generated by design tools are not exhaustively exploited. The logs of this domain are multi-source, multi-format, and have a heterogeneous and evolving structure. Moreover, they usually do not respect the grammar and the structures of natural language though they are written in English. According to features of such textual data, applying the classical methods of information extraction is not an easy task, more particularly for terminology extraction. We have previously introduced EXTERLOG approach to extract the terminology from such log files. In this paper, we introduce a new developed version of EXTERLOG guided by Web. We score the extracted terms by a Web and context based measure. We favor the more relevant terms of domain and emphasize the precision by filtering terms based on their scores. The experiments show that EXTERLOG is well-adapted terminology extraction approach from log files.
منابع مشابه
Study on Various Web Mining Functionalities using Web Log Files
As the size of web increases along with number of users, it is very much essential for the website owners to better understand their customers so that they can provide better service, and also enhance the quality of the website. To achieve this they depend on the web access log files. The web access log files can be mined to extract interesting pattern so that the user behavior can be understoo...
متن کاملFrequent Pattern Mining of Web Log Files Working Principles
Frequent pattern mining plays a major role in mining of web log files. Web usage mining is the one of the web mining process that involves application of mining techniques to web server logs to extract the behavior of users. A web usage mining consists of three important phases: data preprocessing, patterns discovery and pattern analysis. In data preprocessing phase the unwanted data are remove...
متن کاملLearning about Online Learning Processes and Students' Motivation through Web Usage Mining
This study illustrates the potential of applying Web usage mining the analysis of Web log files in educational research. It consists of two sub-studies and focuses on two types of analysis, both related to the whole learning process: investigating one learner's activity in order to learn about her or his learning process, and examining the activity of a large group of learners, in order to deve...
متن کاملExploitation of Server Log Files of User Behavior in Order to Inform Administrator
Requests that users are sent to the Web server all in server log files are stored in a file called. It can be stated that users will see all pages or all user requests to the server, the file is stored. In this paper we investigate the behavior of users through web usage mining techniques and using server log files to help service administrators paid websites. At the end a web site downloading ...
متن کاملLOGDIG log file analyzer for mining expected behavior from log files
Log files are often the only way to identify and locate errors in a deployed system. This paper presents a new log file analyzing framework, LOGDIG, for checking expected system behavior from log files. LOGDIG is a generic framework, but it is motivated by logs that include temporal data (timestamps) and system-specific data (e.g. spatial data with coordinates of moving objects), which are pres...
متن کامل